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Abstract 

The major enzyme responsible for the glucuronidation of bilirubin is the uridine 5'-diphosphoglucose glucuronosyl- 
transferase A1 (UGT1A1) enzyme, and genetic variation in the UGT1A1 gene is reported to influence the bilirubin 
concentration in the blood. In this study, we have investigated which gene-/haplotype variants may be useful for genetic 
testing of Gilbert's syndrome. Two groups of samples based on serum bilirubin concentrations were obtained from the 
Nordic Reference Interval Project Bio-bank and Database (NOBIDA): the 150 individuals with the highest bilirubin 
(>1 7.5 |imol/L) and the 150 individuals with normal bilirubin concentrations (<1 7.5 |xmol/L). The individuals were 
examined for the TA 6 >TA 7 variant in the UGT1A1 promoter and 7 tag-SNPs in an extended promoter region of UGT1A1 
(haplotype analysis) and in selected SNPs in candidate genes (SLC01B3, ABCC2 and NUP153). We found significant odds 
ratios for high bilirubin level for all the selected UGT1A1 variants. However, in stepwise multivariate logistic regression 
analysis of all genetic variants together with age, sex, country of origin and fasting time, the repeat variants of UGT1A1 
TA 6 >TA 7 and SLC01B3 rs21 17032 T>C were the only variants significantly associated with higher bilirubin concentrations. 
Most individuals with high bilirubin levels were homozygous for the TA 7 -repeat (74%) while only 3% were homozygous for 
the TA 7 -repeat in individuals with normal bilirubin levels. Among individuals heterozygous for the TA 7 -repeat, a low 
frequent UGT1 Al-diplotype harboring the rs7564935 G-variant was associated with higher bilirubin levels. In conclusion, our 
results demonstrate that in testing for Gilbert's syndrome, analyzing for the homozygous TA 7 /TA 7 -genotype would be 
appropriate. 
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Introduction 

High levels of bilirubin in serum or plasma may indicate 
increased degradation of hemoglobin (hemolytic disease), reduced 
transport of bilirubin into the liver, reduced glucoronidation of 
bilirubin or be a sign of hepato-biliary disease. The major enzyme 
responsible for the glucuronidation of bilirubin is the uridine 5'- 
diphosphoglucose glucuronosyltransferase Al (UGT1A1) enzyme 
and genetic variation in the UGT1A1 gene is reported to influence 
the serum/plasma bilirubin concentration [1]. The most common 
variant in UGT1A1 is a TA -insertion (TA B >TA 7 ) in the 
promoter of UGT1A1, which is seen in individuals with low 
grade hyperbilirubinemias (Gilbert's syndrome, also called Gilbert- 
Meulengracht syndrome) [1—4]. In individuals with moderately 
increased serum bilirubin concentration it is important to 
determine whether this increase is due to genetic variations in 
UGT1A1 or other genes, or is caused by liver disease. Use of 
genetic testing for Gilbert's syndrome has been suggested [5] . This 
requires sufficient knowledge on which gene variants to test for as 
well as data on prevalence and penetrance of the gene variants 



included in the testing. At present, satisfactory data are only 
available for the prevalence of a few specific gene variants. Here, 
we present additional data and put forward a proposal on which 
genes and which gene variants to be included in diagnostic testing 
as well as the penetrance of the gene variants chosen. We are 
convinced this information will improve the use of genetic tests to 
detect or exclude Gilbert's syndrome. 

Results 

The distribution of serum bilirubin concentrations in the total 
NORIP population is skewed towards high values, and the 91.5 
percentile of 17.5 umol/L separates the 150 samples with high 
bilirubin from the 150 samples with normal bilirubin (Figure SI). 
The 17.5 umol/L was therefore used as a cut-off to separate the 
two bilirubin groups. Compared to the normal bilirubin group, the 
individuals in the high bilirubin group were younger (mean age 45 
and 53 years, respectively), contained more men (56 and 45%, 
respectively) had fasted for a longer time (mean 1 1 and 9 hours), 
and were mosdy Finns. 
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Figure 1. Cumulative numbers of UGT1A1 rs81 75347 genotypes according to bilirubin concentration in individuals with A) normal 
bilirubin and B) high bilirubin (n = 150 in each group). The thick line represents the TA 7 M"A 7 genotype, the thin line represents the TAe/TA 7 - 
genotype and the stippled line represents the TA 6 /TA 6 -genotype. 
doi:1 0.1 371 /journal.pone.0090248.g001 



The genotype of a repeat variant (rs8 175347) and 7 single 
nucleotide polymorphisms (SNP's) in the UGT1A1 promoter 
(rs2003569, rs4124874, rsl7862878, rsl3009407, rsl7862875, 
rsl7862874, rs7564935), as well as SNPs within 3 other genes, 
the solute carrier organic anion transporter 1B3 (SLC01B3) 



(rs2 1 1 7032, rs 1 7680 137), the ATP-binding cassette transporter C2 
(ABCC2) (rs717620) and the nucleoporin gene (NUP153) 
(rs2328136), were determined in all the 300 samples. The allele 
frequencies of these variants are shown in Table 1 together with 
published frequencies in North-Europeans. None of the SNPs in 



Table 1. Frequencies and chromosomal localization of SNPs in normal and high bilirubin individuals. 



Chromosomal location GRCh37/ 

hg19 Minor allele frequency P-values 



Gene (SNP) 


Chromosome 


Nucleotide 
position 


Normal 
Bilirubin 


High 
Bilirubin 


Bilirubin 
Total" 


Caucasians 


HWE P- 
value" 


P-value c 


UGT1A1 


Rs81 75347 (TA 6 >TA 7 ) 


2 


234668881 


0.30 


0.85 


0.35 


0.31-0.39 b 


0.20 


0.01 


Rs2003569 (OA) 


2 


234667937 


0.12 


0.01 


0.11 


0.13-0.18 


0.51 


0.01 


Rs41 24874 (T>G) 


2 


234665659 


0.43 


0.87 


0.47 


0.39-0.56 


0.95 


0.01 


Rs1 7862878 (G>A) 


2 


234661948 


0.05 


0.01 


0.05 


0.08 


0.11 


0.01 


Rs1 3009407 (OG) 


2 


234652347 


0.23 


0.64 


0.27 


0.23-0.26 


0.20 


0.01 


Rs1 7862875 (G>A) 


2 


234649302 


0.29 


0.77 


0.33 


0.30-0.32 


0.15 


0.01 


Rsl 7862874 (A>G) 


2 


234648746 


0.07 


0.01 


0.07 


0.05-0.06 


0.43 


0.01 


Rs7564935 (G>T) 


2 


234645186 


0.36 


0.77 


0.40 


0.37 


0.50 


0.01 


SLC01B3 


Rs21 17032 (T>C) 


12 


21074122 


0.34 


0.42 


0.35 


0.33-0.37 


0.91 


0.08 


Rs1 76801 37 (OG) 


12 


21015906 


0.14 


0.16 


0.14 


0.17-0.19 


0.43 


0.51 


ABCC2 


Rs717620 (OT) 


10 


101542578 


0.16 


0.18 


0.16 


0.15-0.23 


0.35 


0.54 


NUP153 


Rs2328136 (G>A) 


6 


17709551 


0.21 


0.19 


0.21 


0.16-0.20 


0.27 


0.55 



Minor allele frequencies for Caucasians were obtained from dbSNP at http://www.ncbi.nlm.nih.gov/SNP or from references indicated. 

a Bilirubin Total frequencies and HWE P-values were estimated based on the total NORIP population: The normal and high bilirubin groups represent the 91.5 and 8.5 

percentiles of the total NORIP population, respectively. 

b Ref:[24,25]. 

c Pearson chi-square test was used to compare SNP frequencies between normal- and high bilirubin individuals. P-values were adjusted for multiple comparison by false 

discovery rate (FDR); q<0.05. 

doi:1 0.1 371 /joumal.pone.0090248.t001 
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UGT1A1, SLC01B3, ABCC2 or NUP153 deviated from the 
Hardy-Weinberg equilibrium. 

UGT1A1 Variants 

Among the well-known UGT1A1 TA-repeat variant, the 
TA 6 >TA 7 allele (also named UGT1A1*28) was the only variant 
found in this study, while the rare TA 6 >TA 5 (UGT1A1*36) or 
TA 6 >TA 8 (UGT1A1*37) alleles were not found. In the normal 
bilirubin group, the TA s -containing genotypes (TA 6 /TA 7 and 
TAg/TAg) were the dominant ones (79 and 64 out of 148, 
Figure 1 A) and were the only genotypes found below 11.5 |imol/L 
(51 and 51, respectively). The TA 7 /TA 7 genotype was found in 5 
individuals (3%). The bilirubin in these 5 individuals were all 
above 1 1 .5 umol/L, representing the upper third of the bilirubin 
distribution. By contrast, in the high bilirubin group (Figure IB) 
the TA 7 -genotypes (TA 7 /TA 7 and TA 6 /TA 7 ) clearly dominated 
(109 and 35 out of 148, respectively) and were the only genotypes 
above 24.3 umol/L (62 and 6, respectively). Since the normal and 
high bilirubin groups represent 91.5 and 8.5% of the NORIP 
bilirubin distribution, it can be estimated that the TAg/TAg - 
genotype occurs in 39.8%, TA B /TA 7 in 50.8% and TA 7 /TA 7 in 
9.3% of the NORIP population. Likewise, the penetrance for high 
bilirubin was estimated to be 1 % for the TAg/TAg -genotype, 4% 
for TAg/TA 7 and 67% for TA 7 /TA 7 in the NORIP population. 
From the data of Figure 1, we have estimated the cumulative 
distribution of TAg/TAg, TA 6 /TA 7 and TA 7 /TA7-carriers with 
increasing serum bilirubin concentration (See Table SI). Sub- 
grouped by country, the estimated frequencies of the TA 7 -allele 
were 33, 34, 34 and 40% in Norwegians, Danes, Finns and Swedes 
respectively. However, in the high bilirubin group, Finns 
harboring the TA 7 /TA 7 -genotype were most frequent (45% 
compared to 15-21%). 

Four of the seven UGT1A1 promoter tag-SNPs were in close 
linkage disequilibrium with the TAg>TA 7 -allele (r >0.5). All the 
8 UGT1A1 variants differed considerably in their allele frequen- 
cies between the normal and high bilirubin group (Table 1). From 
the total eight UGT1A1 promoter variants analyzed in this study, 
we identified nine haplotypes. The frequencies of the six 
haplotypes occurring more than once are shown in Table 2. 
The most common haplotypes were haplotypes #1 (estimated 
frequency of 0.52) and #4 (estimated frequency of 0.27), 
harboring the TAg or TA 7 -alleles in the UGT1A1, respectively. 
Haplotype pairs (diplotypes) were assigned to each individual and 
plotted against corresponding bilirubin concentration (Figure 2). 
All diplotypes harboring the TA 7 -allele, except diplotype 2/4 
(homozygous for the rs7564935 T-allele), had significantly higher 
bilirubin concentrations compared to diplotype 1/1 in the TAg/ 
TAg-group. All diplotypes homozygous for TA 7 exhibited higher 
bilirubin than any other diplotypes. An individual effect of 
UGT1A1 rs4124874 (c.-3279T>G) on bilirubin concentration 
was not found in our population, as seen in Figure 2 where 
diplotype 1/2 and 1/3 harbour the rs4124874 G-allele together 
with the rs8 175347 TAg- aUele. 

Other Gene Variants 

Only one SNP in the SLC01B3 gene (rs21 17032 (T>C) tended 
to be more frequent in the high bilirubin group compared to the 
normal bilirubin group (P = 0.08, Table 1). Four haplotypes were 
identified for the two SLC01B3 variants (data not shown), 
however, assigned diplotypes were similar with respect to bilirubin 
concentrations. The allele frequencies of SLC01B3 rs 1 7680 137 
(C>G), ABCC2 rs717620 (C>T) andNUP153 rs2328136 (G>A) 
were equally frequent in both bilirubin groups. 
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Figure 2. The Relationship between median bilirubin concentrations and UGT1A1 diplotypes. UGT1A1 rs8175347genotypes are marked 
in triangles (TA 6 /TA 5 ), crosses (TA 6 /TA 7 ) and circles (TA 7 M"A 7 ). 
doi:1 0.1 371 /journal.pone.0090248.g002 



Multivariate Analysis 

In stepwise multivariate logistic regression analysis of all genetic 
variants together with age, sex, country of origin and fasting time, 
the repeat variants of UGT1A1 TA 6 >TA 7 and SLC01B3 
rs2 1 1 7032 T>C were significantly associated with higher bilirubin 
concentrations (Table 3). Box-plot of bilirubin concentrations for 
the combinations of UGT1A1 TA 6 >TA 7 and SLC01B3 
(rs2 1 1 7032, T>C) genotypes, in the normal and high bilirubin 
groups combined, is shown in Figure 3. Of the Nordic countries, 
the Finns showed the highest odds for high bilirubin levels 
(Table 3). Furthermore, when individuals were stratified according 
to TA 6 - and TA 7 -genotype groups, Finns showed a higher 
frequency of individuals with the TA 6 /TA 7 -genotype in the high 
bilirubin group (47%) compared to the other countries (17-30%). 
On the other side, Norwegians had the highest frequency of 
individuals with the TA 7 /TA 7 -genotype in the low bilirubin group 
(11% compared to 0-3%). 

A multivariate logistic regression analysis was also made with 
the UGT1A1 TAg>TA 7 variant exchanged with the various 
UGT1A1 diplotypes seen in Figure 2 (with 1/1 as the reference 
diplotype). Among individuals with the TA 6 /TA 7 -genotype, 
diplotype 1/6 showed a significantly higher odds ratio for high 
bilirubin levels than any of the other diplotypes (data not shown). 
Diplotype 1/6 is homozygous for the rs7564935 G-allele. 

Discussion 

We confirmed the well-known close association between the 
TA 7 -repeat variant of the UGT1A1 promoter and high serum 
bilirubin concentrations [1—3]. In our population of healthy 
Nordic individuals, this variant allele was present only in the upper 
forty percent of the serum bilirubin concentration distribution and 
was the most dominant repeat variant when serum bilirubin was 
above 25-30 u.mol/L. 



The other UGT1A1 promoter gene variants (tag SNPs) studied 
covaried to a large extent with the UGT1A1 TAfi>TA 7 variant, 
but in multivariate logistic regression analysis their effect upon 
serum bilirubin concentration became non-significant. In previous 
studies, the UGT1A1 rs4124874 (c.-3279T>G) has been found to 
have an individual effect on bilirubin concentration [6,7], 
however, in our study the presence of the rs4 124874 G-allele 
alone was not associated with increased bilirubin concentration. 
Therefore, our results strongly indicate that the TA 7 -allele of 
rs8 175347 is the predominant UGT1A1 variant being responsible 
for increased bilirubin concentrations in our population, and not 
rs4 124874. 

Among the four SNP's located in other genes, only the C/C- 
genotype of SLC01B3 (rs2 1 1 7032) was related to high serum 
bilirubin concentration, which is in agreement with recent 
literature [8]. The previous described association of rs 17680 137 
(SLC01B3), with increased bilirubin concentrations [8], was not 
confirmed in our study. The two SNPs in SLC01B3 are probably 
of less significance in the case of genetic testing for Gilbert's 
syndrome, as a large majority of the TA 6 /TA s -carriers and TA 6 / 
TA 7 -carriers with these SNPs showed bilirubin concentration well 
below 25 |jmol/L. 

We did not find any association between bilirubin levels and 
ABCC2 (rs7 17620) or NUP-153 (rs2328136). ABCC2 mediates 
transport of a wide range of anionic substrates into the bile, 
including conjugated bilirubin [9], and an intronic polymorphism 
(rs7 17620) has been associated with reduced ABCC2 promoter 
activity [10]. However, we found no correlation between this SNP 
and bilirubin concentration in our samples. Furthermore, we 
could not confirm a correlation between NUP-153 (rs2328136) and 
lower bilirubin concentration which was found in an Indian 
population [1 1]. We found a small increase, rather than decrease, 
in bilirubin concentration in our study (data not shown). 

The prevalence of Gilbert's syndrome has repeatedly been 
reported to be in the range of 6-8% [12,13], and our estimate of 
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the TAy/TAy-genotype being found in 9.3% of the Nordic 
population as a whole is in agreement with this statement. 
However, the prevalence of the TA 7 /TA 7 -genotype varies much 
between the Nordic countries, being highest in Finland and lowest 
in Norway. Interestingly, the country-specific serum bilirubin 
upper reference limit reported in the NORIP study varied 
similarly, being 30 u.mol/L in Finland and 25, 24 and 21 |tmol/ 
L in Sweden, Denmark and Norway, respectively. The overall 
upper reference limit was calculated to 25 u.mol/L [14]. Given the 
close relationship between TA 7 /TA 7 and high serum bilirubin 
concentrations, the finding that Finns having high mean bilirubin 
concentration also had the highest prevalence of the TA 7 /TA 7 - 
genotype was not unexpected. In the multivariate logistic 
regression modelling (Table 3), a model without adjustment for 
country influenced the OR for high bilirubin of the TA 7 /TA 7 - 
genotype more than 40%, indicating that the high prevalence of 
the TA 7 /TA 7 -genotype in Finns could be a confounding factor. 
Additionally, the finding that Finns had a significantly higher 
bilirubin concentration in individuals with the TAg/TA 7 -genotype 
compared to Norwegians, indicate that other factors, not 
measured in this study, are more prevalent in Finns. 

When gene testing for Gilbert's syndrome, we will from time to 
time identify the TA B /TA 7 -genotype. What is the significance of 
such a finding? Clearly, the TA6/TA 7 -genotype in general shows 
higher serum bilirubin levels than does TAg/TAg. But neverthe- 
less, the data in Figure 2 does not suggest one specific genetic 
constellation added to TAg/TA 7 that gives rise to very high 
bilirubin concentrations. By not including the TAg/TA 7 -genotype 
as an explanation to Gilbert's syndrome, there will be a definite, 
but small, rate of false negative results. Alternatively, by including 
TA 6 /TA 7 there is an obvious risk of many false positive results. A 
false positive result appears less acceptable than a false negative 
result, since a false positive result wrongly suggests Gilbert's 

Table 3. The Odds ratio (OR) for high bilirubin levels. 



syndrome and might stop the diagnostic process, whereas a false 
negative result will only lead to additional, but probably in the 
end, unnecessary examinations. 

Knowledge about prevalence and penetrance of the gene 
variants responsible for Gilbert's syndrome is a prerequisite for 
useful genetic testing [5]. We suggest that the UGT1A1*28- 
variants are the only gene variants to test for. The prevalence of 
this variant is well known. The penetrance, i.e. the fraction of the 
TA 7 /TA 7 -genotype above a defined bilirubin decision value, is 
now possible to estimate from our distribution data of TAg/TA 6 -, 
TA 6 /TA 7 - and TA 7 /TA 7 -genotypes with increasing serum 
bilirubin concentration. As for the TA 7 /TA 7 -genotype these data 
should be particularly precise above 17.5 u.mol/L, which is the 
concentration range where genetic testing for Gilbert's syndrome 
might be requested. 

In addition to bilirubin, UGT1A1 also combines with several 
medical drugs such as statins, opioids, steroids, thyroid hormones 
and anticancer drugs to form the corresponding glucuronides as 
phase II reactions of drug metabolism [4] . The glucuronides are 
mostly less biologically active and more water-soluble than the 
parent compound. Genetic variants in UGT1A1 can thus possibly 
alter the metabolism of these drugs leading to accumulation of the 
drug in the body causing both increased side effects and possible 
toxicity. This has been shown for the active metabolite SN-38 of 
the anticancer drug irinotecan. Patients who are homozygous for 
the UGT1A1 TA 7 -allele have an increased risk for inrinotecan 
induced toxicity like neutropenia [15]. Because of this, The US 
Food and Drug Administration (FDA) has recommend reduced 
starting dose for patients with this genotype. However, it remains 
to be tested if the different haplotypes found in our study will give 
rise to increased toxicity of SN-38. 







Crude 




Adjusted 




n = 289 


OR (95% CI) 


P-value 


OR (95% CI) 


P- value 


Age (yr) 


0.98 (0.97-0.99) 


<0.0001 


0.97 (0.95-0.99) 


0.007 


Sex 


Female 


1.00 




1.00 




Male 


1 .54 (0.97-2.42) 


0.065 


3.43 (1.48-7.92) 


0.004 


Fasting time (hr) 


1.11 (1.05-1.18) 


<0.0001 


1.15 (1.03-1.27) 


0.011 


Country 


Norway 


1.00 




1.00 




Denmark 


1.37 (0.67-2.81) 


0.394 


3.48 (0.88-1 3.82) 


0.076 


Sweden 


1.75 (0.86-3.57) 


0.123 


3.32 (0.85-13.07) 


0.086 


Finland 


3.32 (1.71-6.45) 


<0.0001 


5.82 (1.56-21.67) 


0.009 


UGT1A1 (Rs81 75347) 


TAe/TAe 


1.00 




1.00 






7.09 (2.39-20.99) 


<0.0001 


1 1 .88 (2.97-47.47) 


<0.0001 


TA 7 /TA 7 


348.8 (90.38-1346.18) 


<0.0001 


1253.75 (196.88-7983.99) 


-C0.0001 


SLC01B3 (Rs21 17032) 


T/T 


1.00 




1.00 




C/T 


1.34 (0.81-2.22) 


0.25 


1.35 (0.55-3.28) 


0.51 


C/C 


2.09 (1.01-4.32) 


0.047 


5.75 (1.65-20.04) 


0.006 



- 2 Log likelihood: 165.36. 

doi:1 0.1 371 /joumal.pone.0090248.t003 
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Figure 3. Box plot of bilirubin for different genotypes of UGT1 A1 TA 6 /TA 7 and SLCOI B3 T>C. The box represents the values from the 25 
to 75% percentile (interquartile range; IQR). The middle line represents the median. The vertical line extends from the minimum to the maximum 
value, excluding outliers (>1.5 IQR of the 75% percentile) which are marked as open circles. 
doi:1 0.1 371 /journal.pone.0090248.g003 



Materials and Methods 

Population 

The Nordic reference interval project (NORIP), organized by 
the Nordic Society of Clinical Chemistry and involving 102 
laboratories from Denmark, Finland, Iceland, Norway and 
Sweden, collected blood samples from 3035 healthy individuals 
during the years 2000-2001 [16]. Each of the laboratories 
included 25-50 persons, evenly distributed in gender and four 
age groups (18-30, 31-50, 51-70 and above 70 years). In serum 
and plasma, bilirubin and 24 other of the most common quantities 
used in medical biochemistry were measured, and reference 
intervals for these quantities were calculated [14,16]. During this 
process, 28 individuals were excluded according to predefined 
exclusion criteria; in most cases, because two or more of an 
individual's 25 serum components were considered outiiers based 
on statistical considerations. All laboratories were invited to send 
their blood samples for establishment of the Nordic Reference 
Interval Project Bio-bank and Database (NOBIDA). Together, 94 
laboratories sent their samples (buffy coat from 5 mL of EDTA- 
blood, serum and plasma) from 2500 individuals. Located in 
Denmark and stored below — 60°C, NOBIDA samples can be 
applied for to correlate reference values to genotypes [17]. At the 
time of our study, buffy coat samples from 1800 individuals were 
available. We considered these 1800 individuals as representative 



for the NORIP population since the samples from the 700 missing 
individuals had been removed by random sampling for use in 
various reference interval studies. For the present study, we 
obtained two groups of samples from NOBIDA based on serum 
bilirubin concentrations: (i) buffy coat from the 150 individuals 
with the highest bilirubin (> 17.5 u.mol/L) and (ii) 150 random 
samples from the remaining 1650 NOBIDA samples 
(< 17.5 nmol/L). We also received all NORIP-data available of 
these 300 samples/individuals. 

Ethics Statement 

Written informed consent for use with NORIP and NOBIDA 
was obtained from all subjects before blood samples were drawn. 
The study was approved by Regional Ethics Committee in all five 
countries. In addition the Norwegian Regional Ethics Committee 
approved this retrospective study. Patient records/information was 
de-identified prior to analysis. 

Gene Variant Selection 

UGT1A1. The TA-repeat variant in the promoter of 
UGT1A1 (rs81 75347) has a well-documented effect on serum 
bilirubin concentration [2] and was therefore included in this 
study. Furthermore, Single Nucleotide Polymorphisms (SNPs) 
located in the upstream region of UGT1A1 exon 1, but 
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downstream of UGT1A3 exon 1 (~30 Kbp), were selected using 
the Tagger program [18] from the HAPMAP website (http:// 
www.hapmap.org) with the CEU dataset (release 24, phase II, 
Nov08). Based on the chosen criteria (pairwise tagging algorithm, 
r 2 >0.9 and minor allele frequency of at least 0.05), seven tag-SNPs 
(rs2003569, rs4124874, rsl7862878, rsl3009407, rsl7862875, 
rsl 7862874, rs7564935) were selected for genotyping. 

Other Genes. Based on a non-systematic review of the 
literature concerning genetic variants affecting bilirubin levels in 
serum, several SNPs in different genes were selected: Rs2117032 
and rsl7680137 [8] located in SLC01B3 that is involved in 
transport of unconjugated bilirubin into the liver, rs7 17620 [10] 
located in ABCC2 that is involved in transport of conjugated 
bilirubin out of the liver, and rs2328136 [1 1], recently found to be 
associated with bilirubin concentration variation and located in 
the gene coding for the nuclear protein NUP53. 

DNA Isolation 

DNA was extracted from 100 jU.1 buffy coat diluted with equal 
amount phosphate-buffered saline (PBS) using the MagNA Pure 
LC DNA Isolation Kit I (cat# 03 003 990 001, Roche, Basel, 
Switzerland) on a MagNA Pure LC Instrument (Roche). The 
DNA yield and purity were measured using a Nanodrop 1000 
(Thermo Scientific, Waltham, USA). The average DNA concen- 
tration was 55.5 ng/|j,l and all samples were diluted 1:5 with 
DNase free water and stored at — 20°C prior to downstream 
applications. 

Genotyping of the UGT1A1 (TA) n Repeat 

The UGT1A1 (TA) n repeat (rs8175347) was genotyped by 
pyrosequencing using a PyroMark Q24 (Qiagen, Venlo, The 
Netherlands). A 169-bp fragment of the promoter region of the 
UGT1A1 gene was amplified using a biotinylated forward primer 
5'-AACAATAACTTGGTGTATCGATTGGT-3' in combina- 
tion with the reverse primer 5 '-CAGGATGGGAC ACC ACTG-3 ' 
using PyroMark PCR Kit (cat. no. 978703, Qiagen). Each PCR 
reaction contained 1 x PyroMark PCR Master Mix, 1 xCoralLoad 
Concentrate, 1 x QjSolution, 4.0 mM MgCl 2 and 0.2 uM of each 
primer in a total volume of 25 ul. The cycling conditions 
comprised 15 min polymerase activation at 95 °C followed by 45 
cycles of 30 seconds at 94°C, 58°C and 72°C and a final extension 
step at 72°C for 10 min. The PCR product (10 ul) was then used 
for pyrosequencing with a reverse sequencing primer 5'- 
TCGCCCTCTCCTACTTATAT-3' on PyroMark Q24 accord- 
ing to the manufacturer's instructions. The samples were analyzed 
using PyroMark Q24 Software v2.0.6. 

Genotyping of Single Nucleotide Polymorphisms (SNPs) 

To detect SNPs in the target nucleic acid sequences, allelic 
discrimination experiments using predesigned TaqMan® SNP 
Genotyping Assays (Life Technologies, Carlsbad, USA) were 
performed on a Viia7 Real-Time PCR System (Life Technolo- 
gies). The allelic discrimination experiment includes three steps; a 
pre-PCR read at 60°C for 30 s to collect baseline fluorescence 
data, an amplification run (quantitative real-time PCR) and a post- 
PCR read at 60°C for 30 s to collect endpoint fluorescence data 
and determine the results for genotyping. The real-time PCR set- 
up was 12.5 u.1 2 x TaqMan Universal PCR Master Mix (cat. 
no. 4304437, Life Technologies), 1.25 ul TaqMan® SNP Geno- 
typing Assay (See Table S2), 4 ul genomic DNA and 7.25 u.1 
DNase free water. The PCR reaction comprised 10 min 
polymerase activation at 95°C followed by 40 cycles of 95°C for 



15 s and 60°C for 1 min. A non-template control and positive 
controls, DNA samples with known genotype from Coriell 
Institute for Medical Research (See Table S3), were included in 
each run. After signal normalization and multicomponent analysis, 
the Viia7 RUO Software vl.2 graphed the results of the allelic 
discrimination runs on a scatterplot. 

Statistical Analysis 

Power calculation: A sample size of 150 individuals with high 
bilirubin levels and 150 individuals with normal bilirubin level 
generates &80% power to detect a genotypic relative risk of ^2 
between the two bilirubin groups under an additive model (for 
allele frequencies of >0.1) (QUANTO, version 1.2). 

Statistical analysis was performed using SPSS 16.0 statistic 
software (SPSS Inc., Chicago, USA). The Pearson chi-square test 
was used for testing categorical variables (SNP frequencies). The 
Benjamini-Hochberg false discovery rate (FDR) procedure was 
used for multiple testing correction (q<0.05) [19]. Univariate and 
multivariate forward conditional logistic regression analyses were 
performed to evaluate the impact of selected polymorphisms in 
UGT1A1, SLC01B3, ABCC2 and NUP153 genes on bilirubin 
concentrations, after adjustments for covariates. The significance 
level of P^O.05 was set for entry into the models and P&0.10 for 
removal from the models. 

For gene locus consisting of two or more SNPs (UGT1A1 and 
SLC01B3), the pairwise correlation coefficients (r 2 ) of linkage 
disequilibrium was calculated and plotted using JLIN (A Java 
based Linkage Disequilibrium plotter, version 1.6.0) [20]. Haplo- 
types and diplotypes for SNPs were estimated by PHASE (version 
2.1) [21,22]. Diplotypes were assigned to each individual with a 
probability cut off of 0.9 for assignment. Hardy-Weinberg 
equilibrium was tested for each individual SNP using either JLIN 
or Michael H. Court's (2005-2008) online calculator [23]. Two- 
sided exact Mann- Whitney tests were used to compare bilirubin 
values between two diplotype groups. 

Supporting Information 

Figure SI Serum bilirubin concentrations in the NORIP 
population (n = 2884). 

(PPTX) 

Table SI Cumulative distribution of UGTTA1 -variants 
with increasing serum bilirubin concentration. 

(DOC) 

Table S2 TaqMan® SNP Genotyping Assays, Life Tech- 
nologies. 

(XLSX) 

Table S3 Positive controls used in genotyping from 
Coriell Institute for Medical Research. 

(XLSX) 
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